Overview

Dataset statistics

Number of variables44
Number of observations398194
Missing cells515165
Missing cells (%)2.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory133.7 MiB
Average record size in memory352.0 B

Variable types

CAT24
NUM17
BOOL3

Reproduction

Analysis started2020-07-07 14:53:46.781985
Analysis finished2020-07-07 14:56:23.166784
Duration2 minutes and 36.38 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

INJURIES_UNKNOWN has constant value "0.0" Constant
CRASH_DATE has a high cardinality: 259780 distinct values High cardinality
DATE_POLICE_NOTIFIED has a high cardinality: 307208 distinct values High cardinality
STREET_NAME has a high cardinality: 1500 distinct values High cardinality
LOCATION has a high cardinality: 183467 distinct values High cardinality
Unnamed: 0 is highly correlated with df_indexHigh correlation
df_index is highly correlated with Unnamed: 0High correlation
LONGITUDE is highly correlated with LATITUDEHigh correlation
LATITUDE is highly correlated with LONGITUDEHigh correlation
LANE_CNT has 205498 (51.6%) missing values Missing
INTERSECTION_RELATED_I has 309667 (77.8%) missing values Missing
LANE_CNT is highly skewed (γ1 = 344.7025902) Skewed
LATITUDE is highly skewed (γ1 = -111.1077614) Skewed
LONGITUDE is highly skewed (γ1 = 120.2509699) Skewed
CRASH_DATE is uniformly distributed Uniform
DATE_POLICE_NOTIFIED is uniformly distributed Uniform
df_index has unique values Unique
Unnamed: 0 has unique values Unique
CRASH_RECORD_ID has unique values Unique
RD_NO has unique values Unique
POSTED_SPEED_LIMIT has 6421 (1.6%) zeros Zeros
LANE_CNT has 7792 (2.0%) zeros Zeros
INJURIES_TOTAL has 349923 (87.9%) zeros Zeros
INJURIES_INCAPACITATING has 392141 (98.5%) zeros Zeros
INJURIES_NON_INCAPACITATING has 370685 (93.1%) zeros Zeros
INJURIES_REPORTED_NOT_EVIDENT has 381101 (95.7%) zeros Zeros
INJURIES_NO_INDICATION has 6716 (1.7%) zeros Zeros
CRASH_HOUR has 7544 (1.9%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct count398194
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean211161.53800911113
Minimum2
Maximum416197
Zeros0
Zeros (%)0.0%
Memory size3.0 MiB

Quantile statistics

Minimum2
5-th percentile26830.65
Q1108821.25
median211325.5
Q3313776.75
95-th percentile395693.35
Maximum416197
Range416195
Interquartile range (IQR)204955.5

Descriptive statistics

Standard deviation118551.0808
Coefficient of variation (CV)0.561423647
Kurtosis-1.191530902
Mean211161.538
Median Absolute Deviation (MAD)102477.5
Skewness-0.006951055101
Sum8.408325747e+10
Variance1.405435875e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
20471< 0.1%
 
1666601< 0.1%
 
3837261< 0.1%
 
3816791< 0.1%
 
3386721< 0.1%
 
3366251< 0.1%
 
3427701< 0.1%
 
3407231< 0.1%
 
3304841< 0.1%
 
3284371< 0.1%
 
Other values (398184)398184> 99.9%
 
ValueCountFrequency (%) 
21< 0.1%
 
31< 0.1%
 
41< 0.1%
 
51< 0.1%
 
61< 0.1%
 
ValueCountFrequency (%) 
4161971< 0.1%
 
4161961< 0.1%
 
4161951< 0.1%
 
4161941< 0.1%
 
4161931< 0.1%
 

Unnamed: 0
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct count398194
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean211161.53800911113
Minimum2
Maximum416197
Zeros0
Zeros (%)0.0%
Memory size3.0 MiB

Quantile statistics

Minimum2
5-th percentile26830.65
Q1108821.25
median211325.5
Q3313776.75
95-th percentile395693.35
Maximum416197
Range416195
Interquartile range (IQR)204955.5

Descriptive statistics

Standard deviation118551.0808
Coefficient of variation (CV)0.561423647
Kurtosis-1.191530902
Mean211161.538
Median Absolute Deviation (MAD)102477.5
Skewness-0.006951055101
Sum8.408325747e+10
Variance1.405435875e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
20471< 0.1%
 
1666601< 0.1%
 
3837261< 0.1%
 
3816791< 0.1%
 
3386721< 0.1%
 
3366251< 0.1%
 
3427701< 0.1%
 
3407231< 0.1%
 
3304841< 0.1%
 
3284371< 0.1%
 
Other values (398184)398184> 99.9%
 
ValueCountFrequency (%) 
21< 0.1%
 
31< 0.1%
 
41< 0.1%
 
51< 0.1%
 
61< 0.1%
 
ValueCountFrequency (%) 
4161971< 0.1%
 
4161961< 0.1%
 
4161951< 0.1%
 
4161941< 0.1%
 
4161931< 0.1%
 

CRASH_RECORD_ID
Categorical

UNIQUE

Distinct count398194
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
47d4d22e0339b1bf3284cfdcf9d5ec325d09eba37ccc7b93f70f003cbe806cb4538f88fe9b155cb7d018836c6831df2b3cbddb1de5fc3c1aeb2d057a8c15dcae
 
1
5415b00039c962b65cd6c6c84c78e5ded5b437281a1a0c95c67391caa2d1c39559c1391d0a56817e3f7a9ec8b7f4c8464fd4e89695c6065d3a4289f36c660e84
 
1
a72c2f85364749780f67f8d1bc65c03e0867fa4721ffd2e4ccb9fc18708ec8753104dd50db5698e058735c01b86031eb7f48add11f17bb16ca526fe9d6a8e3af
 
1
6fe8ce9f41f01f78e653c434da2ff348b215bf78d995ecdd2d023aed86f1bdca339b0ef816ba36b9410ab9ecfa1f16eb9810e63ca5066b44c55adb7b8f8984e5
 
1
0bce0eff69612c8815fd62a76aebc08a778156f69586f69a9f88bbafeee149d2c7da268e354baa6ff56697a107bbebe05d72ec3a76841b01ae7ac6401152877a
 
1
Other values (398189)
398189
ValueCountFrequency (%) 
47d4d22e0339b1bf3284cfdcf9d5ec325d09eba37ccc7b93f70f003cbe806cb4538f88fe9b155cb7d018836c6831df2b3cbddb1de5fc3c1aeb2d057a8c15dcae1< 0.1%
 
5415b00039c962b65cd6c6c84c78e5ded5b437281a1a0c95c67391caa2d1c39559c1391d0a56817e3f7a9ec8b7f4c8464fd4e89695c6065d3a4289f36c660e841< 0.1%
 
a72c2f85364749780f67f8d1bc65c03e0867fa4721ffd2e4ccb9fc18708ec8753104dd50db5698e058735c01b86031eb7f48add11f17bb16ca526fe9d6a8e3af1< 0.1%
 
6fe8ce9f41f01f78e653c434da2ff348b215bf78d995ecdd2d023aed86f1bdca339b0ef816ba36b9410ab9ecfa1f16eb9810e63ca5066b44c55adb7b8f8984e51< 0.1%
 
0bce0eff69612c8815fd62a76aebc08a778156f69586f69a9f88bbafeee149d2c7da268e354baa6ff56697a107bbebe05d72ec3a76841b01ae7ac6401152877a1< 0.1%
 
90e159d4d3ed879197cd1fa9a15a972f667bdd7ff2563d68aa2c661ab34035762e9bea6e0cec7573ad2e4936a2eb4f6ffc93b9a45f1b655005a120d1e9a791b61< 0.1%
 
7e2a202f2db2739a4a8d07ce6979fb3e47ad765552f373728f4d9dc51becd7668fd335fd7c0d5df4e9df7333ee4cb4ea182ece8f54c26d1f39227cac405f9ae01< 0.1%
 
c27b3981b5bdc1c770d8a360262bcbab413348c22265d5635e671b18e5ac391209d25da4aeb4acbbd4029e75ac02316a625d17a3b800dd1c099dc568d5b6040b1< 0.1%
 
145fd400a0a21f00b6db169626aaef39dff5a0b095846b68ce4170c89e0d4a7d7959270f6f137341b10365bbc145f5d9087d9d8c2a8bfe0d09c1ec228745d64e1< 0.1%
 
340070be8386feac0c79b6ba4ae13ef2c11caaa1770857254bda4656e3553089ca5462f922838147cbe4d1c57a503230e52637948f9823c2b38d0a28644b08331< 0.1%
 
Other values (398184)398184> 99.9%
 

Length

Max length128
Median length128
Mean length128
Min length128

RD_NO
Categorical

UNIQUE

Distinct count398194
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
JB125535
 
1
JA569371
 
1
JD164013
 
1
JA333494
 
1
JD148417
 
1
Other values (398189)
398189
ValueCountFrequency (%) 
JB1255351< 0.1%
 
JA5693711< 0.1%
 
JD1640131< 0.1%
 
JA3334941< 0.1%
 
JD1484171< 0.1%
 
JC1472481< 0.1%
 
JB3125561< 0.1%
 
JD1391371< 0.1%
 
JA2386951< 0.1%
 
JC4225451< 0.1%
 
Other values (398184)398184> 99.9%
 

Length

Max length8
Median length8
Mean length8
Min length8

CRASH_DATE
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count259780
Unique (%)65.2%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
11/10/2017 10:30:00 AM
 
26
11/10/2017 10:00:00 AM
 
20
01/12/2019 02:30:00 PM
 
18
01/12/2019 03:00:00 PM
 
18
01/12/2019 02:00:00 PM
 
17
Other values (259775)
398095
ValueCountFrequency (%) 
11/10/2017 10:30:00 AM26< 0.1%
 
11/10/2017 10:00:00 AM20< 0.1%
 
01/12/2019 02:30:00 PM18< 0.1%
 
01/12/2019 03:00:00 PM18< 0.1%
 
01/12/2019 02:00:00 PM17< 0.1%
 
09/04/2018 08:00:00 AM16< 0.1%
 
02/26/2020 08:00:00 AM15< 0.1%
 
01/12/2019 04:00:00 PM15< 0.1%
 
02/26/2020 07:45:00 AM15< 0.1%
 
02/10/2019 02:00:00 PM15< 0.1%
 
Other values (259770)398019> 99.9%
 

Length

Max length22
Median length22
Mean length22
Min length22

POSTED_SPEED_LIMIT
Real number (ℝ≥0)

ZEROS

Distinct count15
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.260358016444247
Minimum0.0
Maximum70.0
Zeros6421
Zeros (%)1.6%
Memory size3.0 MiB

Quantile statistics

Minimum0
5-th percentile15
Q130
median30
Q330
95-th percentile35
Maximum70
Range70
Interquartile range (IQR)0

Descriptive statistics

Standard deviation6.49276033
Coefficient of variation (CV)0.2297479857
Kurtosis6.834174355
Mean28.26035802
Median Absolute Deviation (MAD)0
Skewness-2.217937382
Sum11253105
Variance42.1559367
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3029463574.0%
 
35270956.8%
 
25238226.0%
 
20150823.8%
 
15135683.4%
 
1080262.0%
 
064211.6%
 
4035840.9%
 
532700.8%
 
4522810.6%
 
Other values (5)4100.1%
 
ValueCountFrequency (%) 
064211.6%
 
532700.8%
 
1080262.0%
 
15135683.4%
 
20150823.8%
 
ValueCountFrequency (%) 
703< 0.1%
 
656< 0.1%
 
6023< 0.1%
 
553050.1%
 
5073< 0.1%
 
Distinct count19
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
NO CONTROLS
229331
TRAFFIC SIGNAL
111086
STOP SIGN/FLASHER
 
39132
UNKNOWN
 
12848
OTHER
 
2319
Other values (14)
 
3478
ValueCountFrequency (%) 
NO CONTROLS22933157.6%
 
TRAFFIC SIGNAL11108627.9%
 
STOP SIGN/FLASHER391329.8%
 
UNKNOWN128483.2%
 
OTHER23190.6%
 
LANE USE MARKING11410.3%
 
YIELD5510.1%
 
OTHER REG. SIGN3750.1%
 
OTHER WARNING SIGN3530.1%
 
RAILROAD CROSSING GATE2760.1%
 
Other values (9)7820.2%
 

Length

Max length24
Median length11
Mean length12.29784226
Min length5

DEVICE_CONDITION
Categorical

Distinct count8
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
NO CONTROLS
231548
FUNCTIONING PROPERLY
138279
UNKNOWN
 
21575
OTHER
 
3030
FUNCTIONING IMPROPERLY
 
2230
Other values (3)
 
1532
ValueCountFrequency (%) 
NO CONTROLS23154858.1%
 
FUNCTIONING PROPERLY13827934.7%
 
UNKNOWN215755.4%
 
OTHER30300.8%
 
FUNCTIONING IMPROPERLY22300.6%
 
NOT FUNCTIONING12920.3%
 
WORN REFLECTIVE MATERIAL188< 0.1%
 
MISSING52< 0.1%
 

Length

Max length24
Median length11
Mean length13.94320105
Min length5
Distinct count12
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
CLEAR
314030
RAIN
 
36485
UNKNOWN
 
17212
SNOW
 
15074
CLOUDY/OVERCAST
 
12242
Other values (7)
 
3151
ValueCountFrequency (%) 
CLEAR31403078.9%
 
RAIN364859.2%
 
UNKNOWN172124.3%
 
SNOW150743.8%
 
CLOUDY/OVERCAST122423.1%
 
OTHER12710.3%
 
FOG/SMOKE/HAZE7560.2%
 
SLEET/HAIL6260.2%
 
FREEZING RAIN/DRIZZLE3600.1%
 
SEVERE CROSS WIND GATE78< 0.1%
 
Other values (2)60< 0.1%
 

Length

Max length24
Median length5
Mean length5.308234177
Min length4
Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
DAYLIGHT
261434
DARKNESS, LIGHTED ROAD
83035
DARKNESS
 
20209
UNKNOWN
 
14392
DUSK
 
12206
ValueCountFrequency (%) 
DAYLIGHT26143465.7%
 
DARKNESS, LIGHTED ROAD8303520.9%
 
DARKNESS202095.1%
 
UNKNOWN143923.6%
 
DUSK122063.1%
 
DAWN69181.7%
 

Length

Max length22
Median length8
Mean length10.69115557
Min length4

FIRST_CRASH_TYPE
Categorical

Distinct count18
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
REAR END
97874
PARKED MOTOR VEHICLE
89406
SIDESWIPE SAME DIRECTION
63545
TURNING
55538
ANGLE
41640
Other values (13)
50191
ValueCountFrequency (%) 
REAR END9787424.6%
 
PARKED MOTOR VEHICLE8940622.5%
 
SIDESWIPE SAME DIRECTION6354516.0%
 
TURNING5553813.9%
 
ANGLE4164010.5%
 
FIXED OBJECT173974.4%
 
PEDESTRIAN93382.3%
 
SIDESWIPE OPPOSITE DIRECTION58781.5%
 
PEDALCYCLIST57011.4%
 
OTHER OBJECT36210.9%
 
Other values (8)82562.1%
 

Length

Max length28
Median length10
Mean length13.46595629
Min length5

TRAFFICWAY_TYPE
Categorical

Distinct count20
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
NOT DIVIDED
180096
DIVIDED - W/MEDIAN (NOT RAISED)
72633
ONE-WAY
52598
PARKING LOT
 
28698
DIVIDED - W/MEDIAN BARRIER
 
23959
Other values (15)
40210
ValueCountFrequency (%) 
NOT DIVIDED18009645.2%
 
DIVIDED - W/MEDIAN (NOT RAISED)7263318.2%
 
ONE-WAY5259813.2%
 
PARKING LOT286987.2%
 
DIVIDED - W/MEDIAN BARRIER239596.0%
 
OTHER114282.9%
 
FOUR WAY86572.2%
 
ALLEY64171.6%
 
UNKNOWN43211.1%
 
CENTER TURN LANE35290.9%
 
Other values (10)58581.5%
 

Length

Max length31
Median length11
Mean length14.69813207
Min length4

LANE_CNT
Real number (ℝ≥0)

MISSING
SKEWED
ZEROS

Distinct count40
Unique (%)< 0.1%
Missing205498
Missing (%)51.6%
Infinite0
Infinite (%)0.0%
Mean13.681062398804334
Minimum0.0
Maximum1191625.0
Zeros7792
Zeros (%)2.0%
Memory size3.0 MiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median2
Q34
95-th percentile4
Maximum1191625
Range1191625
Interquartile range (IQR)2

Descriptive statistics

Standard deviation3009.722933
Coefficient of variation (CV)219.9919016
Kurtosis130406.2615
Mean13.6810624
Median Absolute Deviation (MAD)1
Skewness344.7025902
Sum2636286
Variance9058432.131
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
28825922.2%
 
44809212.1%
 
1316207.9%
 
382622.1%
 
077922.0%
 
643241.1%
 
518810.5%
 
818510.5%
 
7179< 0.1%
 
10151< 0.1%
 
Other values (30)2850.1%
 
(Missing)20549851.6%
 
ValueCountFrequency (%) 
077922.0%
 
1316207.9%
 
28825922.2%
 
382622.1%
 
44809212.1%
 
ValueCountFrequency (%) 
11916251< 0.1%
 
4336341< 0.1%
 
2996791< 0.1%
 
2184741< 0.1%
 
9021< 0.1%
 

ALIGNMENT
Categorical

Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
STRAIGHT AND LEVEL
388465
STRAIGHT ON GRADE
 
4801
CURVE, LEVEL
 
2915
STRAIGHT ON HILLCREST
 
1265
CURVE ON GRADE
 
554
ValueCountFrequency (%) 
STRAIGHT AND LEVEL38846597.6%
 
STRAIGHT ON GRADE48011.2%
 
CURVE, LEVEL29150.7%
 
STRAIGHT ON HILLCREST12650.3%
 
CURVE ON GRADE5540.1%
 
CURVE ON HILLCREST194< 0.1%
 

Length

Max length21
Median length18
Mean length17.94798515
Min length12
Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
DRY
297147
WET
 
56060
UNKNOWN
 
26201
SNOW OR SLUSH
 
14378
ICE
 
3288
Other values (2)
 
1120
ValueCountFrequency (%) 
DRY29714774.6%
 
WET5606014.1%
 
UNKNOWN262016.6%
 
SNOW OR SLUSH143783.6%
 
ICE32880.8%
 
OTHER9270.2%
 
SAND, MUD, DIRT193< 0.1%
 

Length

Max length15
Median length3
Mean length3.6347509
Min length3

ROAD_DEFECT
Categorical

Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
NO DEFECTS
331841
UNKNOWN
 
57079
RUT, HOLES
 
4085
OTHER
 
2304
WORN SURFACE
 
1664
Other values (2)
 
1221
ValueCountFrequency (%) 
NO DEFECTS33184183.3%
 
UNKNOWN5707914.3%
 
RUT, HOLES40851.0%
 
OTHER23040.6%
 
WORN SURFACE16640.4%
 
SHOULDER DEFECT8660.2%
 
DEBRIS ON ROADWAY3550.1%
 

Length

Max length17
Median length10
Mean length9.566507783
Min length5

REPORT_TYPE
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
NOT ON SCENE (DESK REPORT)
243549
ON SCENE
154645
ValueCountFrequency (%) 
NOT ON SCENE (DESK REPORT)24354961.2%
 
ON SCENE15464538.8%
 

Length

Max length26
Median length26
Mean length19.0094125
Min length8

CRASH_TYPE
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
NO INJURY / DRIVE AWAY
306031
INJURY AND / OR TOW DUE TO CRASH
92163
ValueCountFrequency (%) 
NO INJURY / DRIVE AWAY30603176.9%
 
INJURY AND / OR TOW DUE TO CRASH9216323.1%
 

Length

Max length32
Median length22
Mean length24.31452508
Min length22
Distinct count2
Unique (%)< 0.1%
Missing309667
Missing (%)77.8%
Memory size3.0 MiB
Y
84320
N
 
4207
(Missing)
309667
ValueCountFrequency (%) 
Y8432021.2%
 
N42071.1%
 
(Missing)30966777.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
0
290995
1
107199
ValueCountFrequency (%) 
029099573.1%
 
110719926.9%
 

DAMAGE
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
OVER $1,500
225505
$501 - $1,500
119715
$500 OR LESS
52974
ValueCountFrequency (%) 
OVER $1,50022550556.6%
 
$501 - $1,50011971530.1%
 
$500 OR LESS5297413.3%
 

Length

Max length13
Median length11
Mean length11.73432548
Min length11

DATE_POLICE_NOTIFIED
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count307208
Unique (%)77.2%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
02/26/2020 08:30:00 AM
 
12
06/30/2018 09:30:00 PM
 
11
02/14/2020 05:00:00 PM
 
11
05/25/2018 06:00:00 PM
 
10
06/03/2019 06:00:00 PM
 
10
Other values (307203)
398140
ValueCountFrequency (%) 
02/26/2020 08:30:00 AM12< 0.1%
 
06/30/2018 09:30:00 PM11< 0.1%
 
02/14/2020 05:00:00 PM11< 0.1%
 
05/25/2018 06:00:00 PM10< 0.1%
 
06/03/2019 06:00:00 PM10< 0.1%
 
10/25/2017 04:30:00 PM10< 0.1%
 
05/31/2019 04:00:00 PM10< 0.1%
 
05/01/2018 05:00:00 PM10< 0.1%
 
09/13/2019 05:00:00 PM10< 0.1%
 
02/05/2020 05:00:00 PM10< 0.1%
 
Other values (307198)398090> 99.9%
 

Length

Max length22
Median length22
Mean length22
Min length22
Distinct count40
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
UNABLE TO DETERMINE
144644
FAILING TO YIELD RIGHT-OF-WAY
44560
FOLLOWING TOO CLOSELY
44122
NOT APPLICABLE
 
21300
IMPROPER OVERTAKING/PASSING
 
19222
Other values (35)
124346
ValueCountFrequency (%) 
UNABLE TO DETERMINE14464436.3%
 
FAILING TO YIELD RIGHT-OF-WAY4456011.2%
 
FOLLOWING TOO CLOSELY4412211.1%
 
NOT APPLICABLE213005.3%
 
IMPROPER OVERTAKING/PASSING192224.8%
 
IMPROPER BACKING180454.5%
 
FAILING TO REDUCE SPEED TO AVOID CRASH168024.2%
 
IMPROPER LANE USAGE159444.0%
 
IMPROPER TURNING/NO SIGNAL134573.4%
 
DRIVING SKILLS/KNOWLEDGE/EXPERIENCE124483.1%
 
Other values (30)4765012.0%
 

Length

Max length80
Median length19
Mean length23.708195
Min length6
Distinct count40
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
NOT APPLICABLE
159708
UNABLE TO DETERMINE
141731
FAILING TO REDUCE SPEED TO AVOID CRASH
 
16766
DRIVING SKILLS/KNOWLEDGE/EXPERIENCE
 
12714
FAILING TO YIELD RIGHT-OF-WAY
 
12298
Other values (35)
54977
ValueCountFrequency (%) 
NOT APPLICABLE15970840.1%
 
UNABLE TO DETERMINE14173135.6%
 
FAILING TO REDUCE SPEED TO AVOID CRASH167664.2%
 
DRIVING SKILLS/KNOWLEDGE/EXPERIENCE127143.2%
 
FAILING TO YIELD RIGHT-OF-WAY122983.1%
 
FOLLOWING TOO CLOSELY113872.9%
 
IMPROPER OVERTAKING/PASSING59801.5%
 
IMPROPER LANE USAGE59211.5%
 
WEATHER51041.3%
 
IMPROPER TURNING/NO SIGNAL40171.0%
 
Other values (30)225685.7%
 

Length

Max length80
Median length19
Mean length19.71373752
Min length6

STREET_NO
Real number (ℝ≥0)

Distinct count10786
Unique (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3598.388516652687
Minimum1
Maximum13799
Zeros0
Zeros (%)0.0%
Memory size3.0 MiB

Quantile statistics

Minimum1
5-th percentile138
Q11200
median3119
Q35501
95-th percentile8723
Maximum13799
Range13798
Interquartile range (IQR)4301

Descriptive statistics

Standard deviation2815.384209
Coefficient of variation (CV)0.7824013988
Kurtosis0.02470885059
Mean3598.388517
Median Absolute Deviation (MAD)2114
Skewness0.7645571692
Sum1432856717
Variance7926388.243
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10027020.7%
 
160025170.6%
 
20024220.6%
 
80024030.6%
 
30022630.6%
 
50021370.5%
 
630020280.5%
 
60020060.5%
 
120019810.5%
 
470019630.5%
 
Other values (10776)37577294.4%
 
ValueCountFrequency (%) 
114810.4%
 
27750.2%
 
3122< 0.1%
 
479< 0.1%
 
5196< 0.1%
 
ValueCountFrequency (%) 
137993< 0.1%
 
137801< 0.1%
 
1377013< 0.1%
 
137621< 0.1%
 
137581< 0.1%
 

STREET_DIRECTION
Categorical

Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
W
142252
S
129803
N
98675
E
 
27464
ValueCountFrequency (%) 
W14225235.7%
 
S12980332.6%
 
N9867524.8%
 
E274646.9%
 

Length

Max length1
Median length1
Mean length1
Min length1

STREET_NAME
Categorical

HIGH CARDINALITY

Distinct count1500
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
WESTERN AVE
 
10979
PULASKI RD
 
9489
CICERO AVE
 
8694
ASHLAND AVE
 
8620
HALSTED ST
 
7570
Other values (1495)
352842
ValueCountFrequency (%) 
WESTERN AVE109792.8%
 
PULASKI RD94892.4%
 
CICERO AVE86942.2%
 
ASHLAND AVE86202.2%
 
HALSTED ST75701.9%
 
KEDZIE AVE67231.7%
 
MICHIGAN AVE55721.4%
 
STATE ST49111.2%
 
CLARK ST47711.2%
 
NORTH AVE47581.2%
 
Other values (1490)32610781.9%
 

Length

Max length27
Median length10
Mean length10.67221254
Min length5

BEAT_OF_OCCURRENCE
Real number (ℝ≥0)

Distinct count271
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1248.675286418178
Minimum111.0
Maximum2535.0
Zeros0
Zeros (%)0.0%
Memory size3.0 MiB

Quantile statistics

Minimum111
5-th percentile124
Q1715
median1214
Q31824
95-th percentile2512
Maximum2535
Range2424
Interquartile range (IQR)1109

Descriptive statistics

Standard deviation709.6164418
Coefficient of variation (CV)0.5682954164
Kurtosis-1.023902108
Mean1248.675286
Median Absolute Deviation (MAD)590
Skewness0.1351788021
Sum497215007
Variance503555.4945
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
183456431.4%
 
12245671.1%
 
11445321.1%
 
183145221.1%
 
81340261.0%
 
81537750.9%
 
83332280.8%
 
241331070.8%
 
123230540.8%
 
83429610.7%
 
Other values (261)35877990.1%
 
ValueCountFrequency (%) 
11121440.5%
 
11215800.4%
 
11310910.3%
 
11445321.1%
 
12123820.6%
 
ValueCountFrequency (%) 
253512330.3%
 
253416330.4%
 
253326020.7%
 
253210430.3%
 
253110710.3%
 

NUM_UNITS
Real number (ℝ≥0)

Distinct count14
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0218084652204706
Minimum1.0
Maximum18.0
Zeros0
Zeros (%)0.0%
Memory size3.0 MiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q32
95-th percentile3
Maximum18
Range17
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4139876489
Coefficient of variation (CV)0.2047610622
Kurtosis37.17641388
Mean2.021808465
Median Absolute Deviation (MAD)0
Skewness3.077377139
Sum805072
Variance0.1713857735
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
235345288.8%
 
1210665.3%
 
3192674.8%
 
433150.8%
 
57600.2%
 
62060.1%
 
778< 0.1%
 
825< 0.1%
 
911< 0.1%
 
107< 0.1%
 
Other values (4)7< 0.1%
 
ValueCountFrequency (%) 
1210665.3%
 
235345288.8%
 
3192674.8%
 
433150.8%
 
57600.2%
 
ValueCountFrequency (%) 
181< 0.1%
 
151< 0.1%
 
122< 0.1%
 
113< 0.1%
 
107< 0.1%
 
Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
NO INDICATION OF INJURY
349923
NONINCAPACITATING INJURY
 
26484
REPORTED, NOT EVIDENT
 
15490
INCAPACITATING INJURY
 
6013
FATAL
 
284
ValueCountFrequency (%) 
NO INDICATION OF INJURY34992387.9%
 
NONINCAPACITATING INJURY264846.7%
 
REPORTED, NOT EVIDENT154903.9%
 
INCAPACITATING INJURY60131.5%
 
FATAL2840.1%
 

Length

Max length24
Median length23
Mean length22.9456697
Min length5

INJURIES_TOTAL
Real number (ℝ≥0)

ZEROS

Distinct count17
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.16355595513744556
Minimum0.0
Maximum21.0
Zeros349923
Zeros (%)87.9%
Memory size3.0 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum21
Range21
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.5221584262
Coefficient of variation (CV)3.192536926
Kurtosis56.07949933
Mean0.1635559551
Median Absolute Deviation (MAD)0
Skewness5.26015217
Sum65127
Variance0.272649422
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
034992387.9%
 
1371849.3%
 
275521.9%
 
322420.6%
 
47960.2%
 
52960.1%
 
6110< 0.1%
 
744< 0.1%
 
814< 0.1%
 
914< 0.1%
 
Other values (7)19< 0.1%
 
ValueCountFrequency (%) 
034992387.9%
 
1371849.3%
 
275521.9%
 
322420.6%
 
47960.2%
 
ValueCountFrequency (%) 
212< 0.1%
 
191< 0.1%
 
161< 0.1%
 
151< 0.1%
 
132< 0.1%
 

INJURIES_FATAL
Categorical

Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
0
397910
1
 
265
2
 
15
3
 
4
ValueCountFrequency (%) 
039791099.9%
 
12650.1%
 
215< 0.1%
 
34< 0.1%
 

Length

Max length3
Median length3
Mean length3
Min length3

INJURIES_INCAPACITATING
Real number (ℝ≥0)

ZEROS

Distinct count8
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.01762708629462021
Minimum0.0
Maximum7.0
Zeros392141
Zeros (%)98.5%
Memory size3.0 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum7
Range7
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1546000071
Coefficient of variation (CV)8.770593423
Kurtosis190.3613331
Mean0.01762708629
Median Absolute Deviation (MAD)0
Skewness11.63650857
Sum7019
Variance0.0239011622
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
039214198.5%
 
153431.3%
 
25250.1%
 
3128< 0.1%
 
446< 0.1%
 
59< 0.1%
 
71< 0.1%
 
61< 0.1%
 
ValueCountFrequency (%) 
039214198.5%
 
153431.3%
 
25250.1%
 
3128< 0.1%
 
446< 0.1%
 
ValueCountFrequency (%) 
71< 0.1%
 
61< 0.1%
 
59< 0.1%
 
446< 0.1%
 
3128< 0.1%
 

INJURIES_NON_INCAPACITATING
Real number (ℝ≥0)

ZEROS

Distinct count16
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0896146099639874
Minimum0.0
Maximum21.0
Zeros370685
Zeros (%)93.1%
Memory size3.0 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum21
Range21
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3846934118
Coefficient of variation (CV)4.29275329
Kurtosis126.8850279
Mean0.08961460996
Median Absolute Deviation (MAD)0
Skewness7.454393917
Sum35684
Variance0.147989021
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
037068593.1%
 
1220935.5%
 
237130.9%
 
310960.3%
 
43840.1%
 
5130< 0.1%
 
653< 0.1%
 
719< 0.1%
 
105< 0.1%
 
85< 0.1%
 
Other values (6)11< 0.1%
 
ValueCountFrequency (%) 
037068593.1%
 
1220935.5%
 
237130.9%
 
310960.3%
 
43840.1%
 
ValueCountFrequency (%) 
212< 0.1%
 
181< 0.1%
 
161< 0.1%
 
141< 0.1%
 
113< 0.1%
 

INJURIES_REPORTED_NOT_EVIDENT
Real number (ℝ≥0)

ZEROS

Distinct count11
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.055543277899717226
Minimum0.0
Maximum10.0
Zeros381101
Zeros (%)95.7%
Memory size3.0 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum10
Range10
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.2996112402
Coefficient of variation (CV)5.394194429
Kurtosis96.3037381
Mean0.0555432779
Median Absolute Deviation (MAD)0
Skewness7.909177094
Sum22117
Variance0.08976689526
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
038110195.7%
 
1135633.4%
 
225650.6%
 
36450.2%
 
4198< 0.1%
 
582< 0.1%
 
617< 0.1%
 
710< 0.1%
 
85< 0.1%
 
95< 0.1%
 
ValueCountFrequency (%) 
038110195.7%
 
1135633.4%
 
225650.6%
 
36450.2%
 
4198< 0.1%
 
ValueCountFrequency (%) 
103< 0.1%
 
95< 0.1%
 
85< 0.1%
 
710< 0.1%
 
617< 0.1%
 

INJURIES_NO_INDICATION
Real number (ℝ≥0)

ZEROS

Distinct count42
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0195507717343806
Minimum0.0
Maximum61.0
Zeros6716
Zeros (%)1.7%
Memory size3.0 MiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q32
95-th percentile4
Maximum61
Range61
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.15556234
Coefficient of variation (CV)0.5721878131
Kurtosis91.15031613
Mean2.019550772
Median Absolute Deviation (MAD)1
Skewness4.371242876
Sum804173
Variance1.335324321
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
219171048.1%
 
111803229.6%
 
34955212.4%
 
4186684.7%
 
578082.0%
 
067161.7%
 
633130.8%
 
712360.3%
 
85660.1%
 
92360.1%
 
Other values (32)3570.1%
 
ValueCountFrequency (%) 
067161.7%
 
111803229.6%
 
219171048.1%
 
34955212.4%
 
4186684.7%
 
ValueCountFrequency (%) 
611< 0.1%
 
501< 0.1%
 
451< 0.1%
 
423< 0.1%
 
402< 0.1%
 

INJURIES_UNKNOWN
Boolean

CONSTANT
REJECTED

Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
0
398194
ValueCountFrequency (%) 
0398194100.0%
 

CRASH_HOUR
Real number (ℝ≥0)

ZEROS

Distinct count24
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.192619677845473
Minimum0
Maximum23
Zeros7544
Zeros (%)1.9%
Memory size3.0 MiB

Quantile statistics

Minimum0
5-th percentile3
Q19
median14
Q317
95-th percentile22
Maximum23
Range23
Interquartile range (IQR)8

Descriptive statistics

Standard deviation5.45022083
Coefficient of variation (CV)0.4131265028
Kurtosis-0.3903240384
Mean13.19261968
Median Absolute Deviation (MAD)4
Skewness-0.3950513913
Sum5253222
Variance29.70490709
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
16304997.7%
 
15304827.7%
 
17301737.6%
 
14268606.7%
 
18248906.3%
 
13243906.1%
 
12234335.9%
 
8226765.7%
 
11201575.1%
 
9192804.8%
 
Other values (14)14535436.5%
 
ValueCountFrequency (%) 
075441.9%
 
163751.6%
 
256881.4%
 
346731.2%
 
443541.1%
 
ValueCountFrequency (%) 
2394932.4%
 
22115332.9%
 
21124453.1%
 
20139423.5%
 
19178714.5%
 

CRASH_DAY_OF_WEEK
Real number (ℝ≥0)

Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.124816044440649
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Memory size3.0 MiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)4

Descriptive statistics

Standard deviation1.967291944
Coefficient of variation (CV)0.4769405284
Kurtosis-1.228995325
Mean4.124816044
Median Absolute Deviation (MAD)2
Skewness-0.07200053495
Sum1642477
Variance3.870237593
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
66475416.3%
 
75813214.6%
 
35788514.5%
 
55733414.4%
 
45702614.3%
 
25553713.9%
 
14752611.9%
 
ValueCountFrequency (%) 
14752611.9%
 
25553713.9%
 
35788514.5%
 
45702614.3%
 
55733414.4%
 
ValueCountFrequency (%) 
75813214.6%
 
66475416.3%
 
55733414.4%
 
45702614.3%
 
35788514.5%
 

CRASH_MONTH
Real number (ℝ≥0)

Distinct count12
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.6087183634108
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Memory size3.0 MiB

Quantile statistics

Minimum1
5-th percentile1
Q14
median7
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.484479819
Coefficient of variation (CV)0.5272550028
Kurtosis-1.239390955
Mean6.608718363
Median Absolute Deviation (MAD)3
Skewness-0.03985805496
Sum2631552
Variance12.14159961
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10367129.2%
 
12356549.0%
 
5354808.9%
 
11346418.7%
 
9337828.5%
 
1331338.3%
 
6328518.2%
 
3323948.1%
 
2317898.0%
 
8315767.9%
 
Other values (2)6018215.1%
 
ValueCountFrequency (%) 
1331338.3%
 
2317898.0%
 
3323948.1%
 
4302627.6%
 
5354808.9%
 
ValueCountFrequency (%) 
12356549.0%
 
11346418.7%
 
10367129.2%
 
9337828.5%
 
8315767.9%
 

LATITUDE
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED

Distinct count183392
Unique (%)46.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean41.85757696026045
Minimum0.0
Maximum42.022779861
Zeros27
Zeros (%)< 0.1%
Memory size3.0 MiB

Quantile statistics

Minimum0
5-th percentile41.71467264
Q141.78687426
median41.87775187
Q341.92487836
95-th percentile41.99037309
Maximum42.02277986
Range42.02277986
Interquartile range (IQR)0.1380040987

Descriptive statistics

Standard deviation0.3550475061
Coefficient of variation (CV)0.008482275656
Kurtosis13095.67571
Mean41.85757696
Median Absolute Deviation (MAD)0.0674057415
Skewness-111.1077614
Sum16667436
Variance0.1260587316
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
41.976201145330.1%
 
41.791420282660.1%
 
41.75146062600.1%
 
41.722257272120.1%
 
41.789329322000.1%
 
41.75466012184< 0.1%
 
41.90095892170< 0.1%
 
41.74257762146< 0.1%
 
41.73638005140< 0.1%
 
41.79291088138< 0.1%
 
Other values (183382)39594599.4%
 
ValueCountFrequency (%) 
027< 0.1%
 
41.6446701311< 0.1%
 
41.644691522< 0.1%
 
41.644693975< 0.1%
 
41.644712321< 0.1%
 
ValueCountFrequency (%) 
42.022779863< 0.1%
 
42.022736321< 0.1%
 
42.022720171< 0.1%
 
42.022661142< 0.1%
 
42.022660272< 0.1%
 

LONGITUDE
Real number (ℝ)

HIGH CORRELATION
SKEWED

Distinct count183382
Unique (%)46.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-87.67233841457879
Minimum-87.934014222
Maximum0.0
Zeros27
Zeros (%)< 0.1%
Memory size3.0 MiB

Quantile statistics

Minimum-87.93401422
5-th percentile-87.7763876
Q1-87.72075637
median-87.67297088
Q3-87.63301193
95-th percentile-87.58618584
Maximum0
Range87.93401422
Interquartile range (IQR)0.08774443425

Descriptive statistics

Standard deviation0.7243082962
Coefficient of variation (CV)-0.008261537326
Kurtosis14552.70392
Mean-87.67233841
Median Absolute Deviation (MAD)0.04236988
Skewness120.2509699
Sum-34910599.12
Variance0.524622508
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-87.905309135330.1%
 
-87.580147772660.1%
 
-87.585971992600.1%
 
-87.585275572120.1%
 
-87.741645642000.1%
 
-87.74138476184< 0.1%
 
-87.61992817170< 0.1%
 
-87.63393693146< 0.1%
 
-87.62750922140< 0.1%
 
-87.74207734138< 0.1%
 
Other values (183372)39594599.4%
 
ValueCountFrequency (%) 
-87.934014221< 0.1%
 
-87.9339939314< 0.1%
 
-87.933028282< 0.1%
 
-87.927261683< 0.1%
 
-87.925035622< 0.1%
 
ValueCountFrequency (%) 
027< 0.1%
 
-87.524587392< 0.1%
 
-87.524589012< 0.1%
 
-87.524640321< 0.1%
 
-87.524673954< 0.1%
 

LOCATION
Categorical

HIGH CARDINALITY

Distinct count183467
Unique (%)46.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
POINT (-87.905309125103 41.976201139024)
 
533
POINT (-87.580147768689 41.791420282098)
 
266
POINT (-87.585971992965 41.751460603167)
 
260
POINT (-87.585275565077 41.722257273006)
 
212
POINT (-87.741645644196 41.789329323265)
 
200
Other values (183462)
396723
ValueCountFrequency (%) 
POINT (-87.905309125103 41.976201139024)5330.1%
 
POINT (-87.580147768689 41.791420282098)2660.1%
 
POINT (-87.585971992965 41.751460603167)2600.1%
 
POINT (-87.585275565077 41.722257273006)2120.1%
 
POINT (-87.741645644196 41.789329323265)2000.1%
 
POINT (-87.741384758605 41.754660124394)184< 0.1%
 
POINT (-87.619928173678 41.900958919109)170< 0.1%
 
POINT (-87.633936930688 41.742577617335)146< 0.1%
 
POINT (-87.627509219026 41.736380045588)140< 0.1%
 
POINT (-87.742077342959 41.792910883497)138< 0.1%
 
Other values (183457)39594599.4%
 

Length

Max length40
Median length40
Mean length39.77814332
Min length11

STREET_TYPE
Categorical

Distinct count15
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
AVE
201619
ST
122726
RD
 
25362
BLVD
 
14915
DR
 
14585
Other values (10)
 
18987
ValueCountFrequency (%) 
AVE20161950.6%
 
ST12272630.8%
 
RD253626.4%
 
BLVD149153.7%
 
DR145853.7%
 
PL39291.0%
 
NB29360.7%
 
SB28170.7%
 
PKWY27260.7%
 
BROADWAY21190.5%
 
Other values (5)44601.1%
 

Length

Max length8
Median length3
Mean length2.636430986
Min length1

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

df_indexUnnamed: 0CRASH_RECORD_IDRD_NOCRASH_DATEPOSTED_SPEED_LIMITTRAFFIC_CONTROL_DEVICEDEVICE_CONDITIONWEATHER_CONDITIONLIGHTING_CONDITIONFIRST_CRASH_TYPETRAFFICWAY_TYPELANE_CNTALIGNMENTROADWAY_SURFACE_CONDROAD_DEFECTREPORT_TYPECRASH_TYPEINTERSECTION_RELATED_IHIT_AND_RUN_IDAMAGEDATE_POLICE_NOTIFIEDPRIM_CONTRIBUTORY_CAUSESEC_CONTRIBUTORY_CAUSESTREET_NOSTREET_DIRECTIONSTREET_NAMEBEAT_OF_OCCURRENCENUM_UNITSMOST_SEVERE_INJURYINJURIES_TOTALINJURIES_FATALINJURIES_INCAPACITATINGINJURIES_NON_INCAPACITATINGINJURIES_REPORTED_NOT_EVIDENTINJURIES_NO_INDICATIONINJURIES_UNKNOWNCRASH_HOURCRASH_DAY_OF_WEEKCRASH_MONTHLATITUDELONGITUDELOCATIONSTREET_TYPE
022009e9e67203442370272e1a13d6ee51a4155dac65e583d1bdbee1fde686de7508c14ab5f205402f72644001276718917e02985561dc71b7d4bf945f09d7d47f5JA32921606/30/2017 04:00:00 PM35.0STOP SIGN/FLASHERFUNCTIONING PROPERLYCLEARDAYLIGHTTURNINGNOT DIVIDED4.0STRAIGHT AND LEVELDRYNO DEFECTSON SCENEINJURY AND / OR TOW DUE TO CRASHY0OVER $1,50006/30/2017 04:01:00 PMFAILING TO YIELD RIGHT-OF-WAYNOT APPLICABLE8301SCICERO AVE834.02.0NO INDICATION OF INJURY0.00.00.00.00.03.00.0166641.741804-87.740954POINT (-87.740953581987 41.741803598989)AVE
13300e47f189660cd8ba1e85fc63061bf1d8465184393f134fb8251ed7896a4ba9ed7c984ab51a01f564d6f4133c6ef8493b1a369743a4a308d4392900a286e160fJC19477603/21/2019 10:50:00 PM30.0TRAFFIC SIGNALFUNCTIONING PROPERLYCLEARDARKNESS, LIGHTED ROADTURNINGNOT DIVIDED4.0STRAIGHT AND LEVELDRYNO DEFECTSON SCENENO INJURY / DRIVE AWAYY0OVER $1,50003/21/2019 10:52:00 PMUNABLE TO DETERMINEUNABLE TO DETERMINE8301SCICERO AVE834.02.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0225341.741804-87.740954POINT (-87.740953581987 41.741803598989)AVE
2440126747fc9ffc0edc9a38abb83d80034f897db0f739eef57f9bc75de8f2702a4c8f6dd8e49f5c2e810e1ec428bd9532fd0e6c583ca72669da9e65fc2a0a6de12JB20047803/26/2018 02:23:00 PM35.0NO CONTROLSNO CONTROLSCLEARDAYLIGHTPARKED MOTOR VEHICLENOT DIVIDEDNaNSTRAIGHT AND LEVELDRYNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaN0$501 - $1,50003/26/2018 03:20:00 PMUNABLE TO DETERMINEUNABLE TO DETERMINE3999NAVONDALE AVE1732.02.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0142341.953647-87.732082POINT (-87.732081736006 41.953646899951)AVE
3555d672ce84d5b78346be822b388604bdf9cb3fa348a5adc89501859f857e1e9308e35ececb0a56527c6e6d065cc85e4e93302c1c57a97068034116e8b23ec2f4eJD15892702/20/2020 04:45:00 PM35.0TRAFFIC SIGNALOTHERCLEARDAWNREAR ENDT-INTERSECTIONNaNSTRAIGHT AND LEVELDRYNO DEFECTSON SCENENO INJURY / DRIVE AWAYY0$501 - $1,50002/20/2020 04:51:00 PMNOT APPLICABLENOT APPLICABLE12300WIRVING PARK RD1654.02.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0165241.958987-87.933994POINT (-87.933993928974 41.958986950953)RD
4660209e21f298984f7375742b7ef27c9880b485f41123a12b5a8eb14f01171abbbc05974399b985e05a352f801869cae0f41587f39a51994338fb82aeba853eeceJB41543608/30/2018 05:45:00 PM30.0TRAFFIC SIGNALFUNCTIONING PROPERLYCLEARDAYLIGHTTURNINGNOT DIVIDEDNaNSTRAIGHT AND LEVELDRYUNKNOWNNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYY0OVER $1,50008/30/2018 05:58:00 PMIMPROPER OVERTAKING/PASSINGIMPROPER LANE USAGE600WDIVISION ST1822.02.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0175841.903825-87.643286POINT (-87.643286359995 41.903825233976)ST
5770211e1f766f3940dfa87375661d25b716655e908c320cc46910e8fa5fb1f1e6a9d4f714d21e8e401ec9e0a12190b6cd9f6dbc97d32d0c0fc966a02ae516e782fJC30140306/11/2019 08:40:00 AM30.0TRAFFIC SIGNALFUNCTIONING PROPERLYCLEARDAYLIGHTREAR ENDDIVIDED - W/MEDIAN BARRIERNaNSTRAIGHT AND LEVELDRYNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYY0$501 - $1,50006/11/2019 09:05:00 AMUNABLE TO DETERMINENOT APPLICABLE50EGARFIELD BLVD225.02.0NO INDICATION OF INJURY0.00.00.00.00.03.00.083641.794779-87.623828POINT (-87.623828038036 41.794778764028)BLVD
68802e2ed3606a50dda185f5e97c57a45552087d6fbea1c4b5f3777e0503da72279211f0518aabbeca2cd8e8ee1aca6cae3f88a0531cb62bb39ac156ca3d55e0931JB25639305/09/2018 11:30:00 AM25.0NO CONTROLSNO CONTROLSRAINDAYLIGHTANGLENOT DIVIDED2.0STRAIGHT AND LEVELWETNO DEFECTSON SCENENO INJURY / DRIVE AWAYNaN0OVER $1,50005/09/2018 11:35:00 AMFAILING TO YIELD RIGHT-OF-WAYUNABLE TO DETERMINE9511SWENTWORTH AVE511.02.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0114541.721290-87.628510POINT (-87.628509593966 41.72128957001)AVE
79903c8fee8a0cb0d303e972a873228b444a47b7b1ed1e2d97a8c409203dc81a9d97abc26692d325af7428a2f8f880ab0e551e763782226b9b6a0c3e19abd7ffa23JB31741906/22/2018 07:25:00 AM35.0TRAFFIC SIGNALFUNCTIONING PROPERLYRAINDAYLIGHTTURNINGNOT DIVIDED6.0STRAIGHT AND LEVELWETNO DEFECTSON SCENEINJURY AND / OR TOW DUE TO CRASHY0OVER $1,50006/22/2018 07:27:00 AMUNABLE TO DETERMINEUNABLE TO DETERMINE8301SCICERO AVE834.02.0NONINCAPACITATING INJURY2.00.00.02.00.02.00.076641.741804-87.740954POINT (-87.740953581987 41.741803598989)AVE
8101003def753c76d0105940f82e9eaac6f1d87683b7a574c20c10c4018a8df6b96a1a02ba2a77c1a7835bb5ffb7fd1eff3179e01226f4955266d9b7bc8d44d2faf39JB24684305/02/2018 12:50:00 PM30.0NO CONTROLSNO CONTROLSCLEARDAYLIGHTOTHER OBJECTPARKING LOTNaNSTRAIGHT AND LEVELDRYNO DEFECTSON SCENEINJURY AND / OR TOW DUE TO CRASHNaN0OVER $1,50005/02/2018 12:53:00 PMUNABLE TO DETERMINEUNABLE TO DETERMINE1320E47TH ST222.01.0NONINCAPACITATING INJURY1.00.00.01.00.00.00.0124541.809781-87.594213POINT (-87.594212812011 41.809781151018)ST
91111046c0f96fdf5f7384e026821bb23fdd56d610dce11247b4cf7072f4e0308cdf5865ee8f31d71792ef005d864c064aae933213ef5e4e87a9bb2247ffe0f56f245JC12822601/24/2019 06:45:00 AM30.0NO CONTROLSNO CONTROLSCLEARDARKNESS, LIGHTED ROADREAR ENDDIVIDED - W/MEDIAN BARRIER3.0STRAIGHT AND LEVELWETNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaN0OVER $1,50001/24/2019 03:40:00 PMEXCEEDING SAFE SPEED FOR CONDITIONSFOLLOWING TOO CLOSELY50EGARFIELD BLVD225.02.0NO INDICATION OF INJURY0.00.00.00.00.02.00.065141.794779-87.623828POINT (-87.623828038036 41.794778764028)BLVD

Last rows

df_indexUnnamed: 0CRASH_RECORD_IDRD_NOCRASH_DATEPOSTED_SPEED_LIMITTRAFFIC_CONTROL_DEVICEDEVICE_CONDITIONWEATHER_CONDITIONLIGHTING_CONDITIONFIRST_CRASH_TYPETRAFFICWAY_TYPELANE_CNTALIGNMENTROADWAY_SURFACE_CONDROAD_DEFECTREPORT_TYPECRASH_TYPEINTERSECTION_RELATED_IHIT_AND_RUN_IDAMAGEDATE_POLICE_NOTIFIEDPRIM_CONTRIBUTORY_CAUSESEC_CONTRIBUTORY_CAUSESTREET_NOSTREET_DIRECTIONSTREET_NAMEBEAT_OF_OCCURRENCENUM_UNITSMOST_SEVERE_INJURYINJURIES_TOTALINJURIES_FATALINJURIES_INCAPACITATINGINJURIES_NON_INCAPACITATINGINJURIES_REPORTED_NOT_EVIDENTINJURIES_NO_INDICATIONINJURIES_UNKNOWNCRASH_HOURCRASH_DAY_OF_WEEKCRASH_MONTHLATITUDELONGITUDELOCATIONSTREET_TYPE
398184416188416188e03b02d64d5d4db07c0c24fe1e9c495b4ac5b67cbe1e7334d16533b47b2a410bf1481830ec85ab046c4b391bf689476f68af6df1437f9e21cd10cacb10f7308aJD23664105/19/2020 01:00:00 PM25.0NO CONTROLSNO CONTROLSCLEARDAYLIGHTREAR ENDONE-WAYNaNSTRAIGHT AND LEVELWETNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaN1$501 - $1,50005/19/2020 02:30:00 PMNOT APPLICABLENOT APPLICABLE1000SKOLMAR AVE1131.02.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0133541.869118-87.739401POINT (-87.739401314183 41.869117777281)AVE
398185416189416189f198c65371434926c14b2a63ed8792a1389fadaf675d8ee368dc5aaf1a30467a894c1412b2f099985c04308755ad76dfd1589de86c674ff6050c527539baee0dJD23680205/19/2020 03:31:00 PM30.0TRAFFIC SIGNALFUNCTIONING PROPERLYCLEARDAYLIGHTTURNINGFOUR WAYNaNSTRAIGHT AND LEVELDRYNO DEFECTSON SCENENO INJURY / DRIVE AWAYY0$501 - $1,50005/19/2020 03:51:00 PMIMPROPER TURNING/NO SIGNALNOT APPLICABLE2001EMARQUETTE DR331.02.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0153541.775563-87.576444POINT (-87.576444451178 41.775562727526)DR
398186416190416190f52f00333e5cac1530939b110da25a72de89372c032ae422467081eb937a8ff841ff2f5d73c28ededd41b765fad1e5681c1b039b5c02435bb9edb9ee97c94eabJD23694005/19/2020 06:21:00 PM30.0NO CONTROLSNO CONTROLSCLOUDY/OVERCASTDAYLIGHTSIDESWIPE SAME DIRECTIONDIVIDED - W/MEDIAN (NOT RAISED)NaNSTRAIGHT AND LEVELDRYNO DEFECTSON SCENENO INJURY / DRIVE AWAYNaN0$500 OR LESS05/19/2020 06:21:00 PMFAILING TO YIELD RIGHT-OF-WAYNOT APPLICABLE2473NCLARK ST1935.02.0NO INDICATION OF INJURY0.00.00.00.00.03.00.0183541.927478-87.641450POINT (-87.641449775903 41.927477726852)ST
398187416191416191d97477e587b8eecbdb0dbce0a0c5462e7335a4f7da2e04c8e4799193cba69f0b887de56a831e8899031c1f7175e82a2ad8c43ba34260181a3e4e1bafda37ea88JD23402705/16/2020 11:45:00 AM40.0NO CONTROLSNO CONTROLSCLEARDAYLIGHTSIDESWIPE SAME DIRECTIONDIVIDED - W/MEDIAN BARRIERNaNCURVE, LEVELDRYNO DEFECTSON SCENEINJURY AND / OR TOW DUE TO CRASHNaN0OVER $1,50005/16/2020 11:55:00 AMFAILING TO YIELD RIGHT-OF-WAYNOT APPLICABLE120NLAKE SHORE DR SB114.02.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0117541.883642-87.615441POINT (-87.615441182232 41.883641778749)SB
398188416192416192e43ffefbcb0c4518869aac0447be15af688b10f56d652adb93a204fc18d823594c6cf5e64bcf12134ee2e91a692e0434118a1367b5a5904e448f4cdbc850bfb5JD23494605/17/2020 01:22:00 PM30.0NO CONTROLSNO CONTROLSRAINDAYLIGHTPARKED MOTOR VEHICLEDIVIDED - W/MEDIAN BARRIERNaNSTRAIGHT AND LEVELWETUNKNOWNON SCENEINJURY AND / OR TOW DUE TO CRASHNaN0$501 - $1,50005/17/2020 01:22:00 PMIMPROPER LANE USAGEFAILING TO REDUCE SPEED TO AVOID CRASH1924WGARFIELD BLVD932.03.0NO INDICATION OF INJURY0.00.00.00.00.01.00.0131541.794040-87.673019POINT (-87.673018619292 41.79403960152)BLVD
398189416193416193de4cbde297443228427759c6d58f40254dc8d8fcb201dc731da85d4d0dcd05fd2fbb10974bc0d17cdecd4128959af5b2fc2a4b8a0ee42f3370be201aba68a5bdJD23565105/18/2020 12:20:00 PM10.0NO CONTROLSNO CONTROLSCLEARDAYLIGHTREAR TO SIDEPARKING LOTNaNSTRAIGHT AND LEVELDRYNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaN0$500 OR LESS05/18/2020 12:32:00 PMFAILING TO YIELD RIGHT-OF-WAYUNABLE TO DETERMINE1527WLAWRENCE AVE1912.02.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0122541.968773-87.668624POINT (-87.668624408172 41.968772794281)AVE
398190416194416194eb65bb1c2f912fbd1a80406813bbe831df2083d483334697387d443e911400cfad731b727f3ab2ab982ab9583e12ef1bba31838e50a2d4c3979a28223b8ef00fJD23572505/18/2020 09:00:00 AM30.0UNKNOWNUNKNOWNUNKNOWNUNKNOWNPARKED MOTOR VEHICLEUNKNOWNNaNSTRAIGHT AND LEVELUNKNOWNUNKNOWNNOT ON SCENE (DESK REPORT)INJURY AND / OR TOW DUE TO CRASHNaN1OVER $1,50005/18/2020 01:48:00 PMUNABLE TO DETERMINENOT APPLICABLE4015NPARKSIDE AVE1624.02.0NO INDICATION OF INJURY0.00.00.00.00.01.00.092541.953658-87.768095POINT (-87.768095363598 41.953657967626)AVE
398191416195416195d35263055740651327a467ea3b5181fde21e1edad1e917c4096a7ca360f3ef47a2d3dc69666c6fc9bdfe7a9fdf8420654f251fcdf7c657bddc51a3a263e6eaf5JD23650405/19/2020 02:00:00 AM20.0NO CONTROLSNO CONTROLSRAINDARKNESS, LIGHTED ROADPARKED MOTOR VEHICLEONE-WAYNaNSTRAIGHT AND LEVELWETNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaN0$500 OR LESS05/19/2020 11:00:00 AMUNABLE TO DETERMINEUNABLE TO DETERMINE5700NRICHMOND ST2011.02.0NO INDICATION OF INJURY0.00.00.00.00.01.00.023541.985008-87.703007POINT (-87.70300710512 41.985008248097)ST
398192416196416196d613f48076a4221ebaebf9fc31bf396baaf18892529f3f97474e6795d01eb9545440ed9dc0d31201c554798cce001416f0aff7f4214f3848bdb211d8f89915afJD23648705/19/2020 09:40:00 AM30.0NO CONTROLSNO CONTROLSCLEARDAYLIGHTPARKED MOTOR VEHICLENOT DIVIDEDNaNSTRAIGHT AND LEVELDRYNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaN0$501 - $1,50005/19/2020 10:36:00 AMUNABLE TO DETERMINEUNABLE TO DETERMINE7715SCOTTAGE GROVE AVE624.02.0NO INDICATION OF INJURY0.00.00.00.00.01.00.093541.754447-87.605136POINT (-87.605136112415 41.75444668619)AVE
398193416197416197f247683bbffde5cd7d1bddad30ff42c2d3d4594ffd218dc9bdbb270b88edac4d3e2c87a077ea21bede21d27eb2d508c8c9e0a42595a81d1221035e85aeea04cdJD23679605/19/2020 01:32:00 PM10.0NO CONTROLSNO CONTROLSCLEARDAYLIGHTOTHER OBJECTALLEYNaNSTRAIGHT AND LEVELDRYNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaN0OVER $1,50005/19/2020 03:46:00 PMUNABLE TO DETERMINEUNABLE TO DETERMINE925SINDEPENDENCE BLVD1133.01.0NO INDICATION OF INJURY0.00.00.00.00.01.00.0133541.869263-87.719477POINT (-87.719476770345 41.869263306188)BLVD